Increment definitions for scale dependent analysis of stochastic data 
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It is common for scale-dependent analysis of stochastic data to use the increment A(t,r) = 
£(t + r) — £(t) of a data set £(t) as a stochastic measure, where r denotes the scale. For joint 
statistics of A(t,r) and A(t,r') the question how to nest the increments on different scales r,r' is 
investigated. Here we show that in some cases spurious correlations between scales can be introduced 
by the common left-justified definition. The consequences for a Markov process are discussed. These 
spurious correlations can be avoided by an appropriate nesting of increments. We demonstrate this 
effect for different data sets and show how it can be detected and quantified. The problem allows 
to propose a unique method to distinguish between experimental data generated by a noiselike or a 
Langevin-like random-walk process, respectively. 

PACS numbers: 02.50.-r, 05.10.-a, 95.75.Wx 
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I. INTRODUCTION 

The complexity of most disordered systems depends 
on the scale at which they are observed. Therefore, 
stochastic analysis of those systems uses scale-dependent 
quantities for their characterization. The term "scale" 
here means for a data set the distance r between 
two arbitrary points t, t' with t' — t = r (t may denote 
time as well as space in this context). The increment 
A(t, r) — £(£ + r) — £(t) is a common scale-dependent 
measure of complexity and disorder. Well-known exam- 
ples for other scale-dependent measures of complexity are 
the autocorrelation function R(r) — (£(t)£(t + r)), the 

rms width w r {t) — ([£(£) — i] 2 )V 2 , or wavelet functions. 

Traditionally, the investigation of statistical proper- 
ties is performed on distinct scales, e.g., by means of 
the structure functions (A(i, r) n ) given by the probabil- 
ity density functions (PDF) p(A\(t, r)). An advanced ap- 
proach is to try to describe the joint statistics of the cho- 
sen measure on many different scales. This is achieved by 
the knowledge of the joint PDF p(A(t,n); . . . ; A(t,r„)). 
By these joint PDF also the correlations between scales 
are worked out, showing how the complexity is linked 
between scales. 

If the statistics of the scale-dependent measure can be 
regarded as a Markov process evolving in r, the knowl- 
edge of two-scale conditional PDF is sufficient for a com- 
plete description of multiscale joint PDF p| . The condi- 
tional PDF p(Ai(£, ri)|Ao(i, ro)) denotes the probability 
of finding an increment A(i, ri) = Ai on the scale r\ un- 
der the condition that at the same time t on a different 
scale ro another increment A(t, ro) = Ao has been found. 
The validity of the Markov propertycan be tested by the 
investigation of conditional PDF 2], of the Chapman- 



Kolmogorov equation or of reconstructed noise Q|. 
If, furthermore, the noise involved in the process is Gaus- 
sian distributed, the whole joint statistics can be grasped 
by a Fokker-Planck or Langevin equation 
This approach has been used by different researchers in 
a number of applications 0,0,I1jIEIEI3)ISI3- I n some 
cases also the question of increment definitions has been 
discussed, as in Q. 

In this paper we want to address the question of 
whether the relative location of the increments may in- 
troduce spurious correlations between different scales. 
In particular, we investigate the two cases of the left- 
justified increment 

A l (t,r)=£(t + r)-m (1) 

and the centered increment 

A c (i,r)=£(i + r/2)-£(i-r/2). (2) 

We will discuss the implications of these increment defi- 
nitions on two different types of stochastic processes and 
compare the results for experimental data. 



II. INCREMENT DEFINITIONS AND 
CORRELATIONS BETWEEN SCALES 

First, two idealized types of stochastic processes for 
discrete times are defined. The first one is called white- 
noise-like random walk (WNR) with the random variable 
given by 
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which implies that 



and 
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cr 2 kk (4) 
for k > i. (5) 
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FIG. 1: Conditional PDF of (a) left-justified and (b) centered increments of a white-noise-like random walk. Scales ro and 
ri are 116 and 100 sample steps, respectively. PDF are displayed as contour lines, the levels differ by a factor of 10, with an 
additional level at 0.3. 



The second model, called Langevin-like random walk 
(LRW), is the cumulative sum of the first one, i.e., 



III. CONSEQUENCES FOR MARKOV 
PROPERTIES 



(6) 



with w(tk) distributed as in 0. This implies that here, 

(£(*0£(*fc)) = forfc>z (7) 

and (£(*fc)|f = zo) = 2:0 for k > i. (8) 

The correlations between different scales for both in- 
crement functions A;(t, r) and A c (t, r) can be calcu- 
lated easily. Taking increments of the WNR © and 
ri > ro > we get 

(A,(t,r )Ai(t,n)) =a 2 , (A c (t,r )A c (t,n)> = . (9) 

Note that for arbitrary different scales of A; correlations 
are present even though there are no correlations at all in 
the data These spurious correlations are introduced 
by the left-justified increment because for a fixed value t 
the increments Ai(t, r) of all scales r have the term 
in common, see Eq. For the LRW we obtain 

(A,(i,rb)Ai(f,ri)) = (A c (t, r ) A c (t, r x )) = r 0( x 2 (10) 

with r\ > tq > 0. In contrast to the WNR, identical 
correlations of both increment definitions result here. 

Correlations between two scales r ,ri can directly be 
observed as dependence of p(Ai|Ao) on Ao. Increment 
statistics of the WNR are presented in Fig. ^ as condi- 
tional pdf of left-justified and centered increments. Data 
have been generated using the routine gasdev from 
and normalized by a. As expected from Eq. ©, for 
the left-justified increments (a) a correlation between 
both scales 7*1, ro is evident because p(Ai(t, ri)\Ai(t, ro)) 
strongly depends on the value of Ai(t, ro). In contrast, 
the conditional PDF of the centered increments (b) is 
independent of A c (t,ro) and thus both scales are uncor- 
rected for centered increments. For corresponding dia- 
grams calculated for the LRW process (not shown here) 
we find identical PDF for left-justified and centered in- 
crements, similar to Fig. ^a), and in accordance with 
Eq. O- 



For the analysis and description of stochastic data by 
means of a Fokker-Planck equation, as mentioned in the 
Introduction, the underlying process has to be Marko- 
vian. In the previous section we have shown that the left- 
justified increment definition can introduce additional 
correlations. This effect also influences the Markov prop- 
erties of these increments. 

It is indeed straightforward to see that the left -justified 
increment A/(t, r) of a WNR is consequently not a 
Markov process in the scale variable r. A necessary con- 
dition for a stochastic process to be Markovian is the 
Chapman-Kolmogorov equation pj (here we use the no- 
tation Ai(t,n) = Aj,i) 



p{Ai, 3 \A 



p(A,,3|Aj i 2)p(A, )2 |Ai i i)dA z ,2 (11) 




for any triplet r\ < r 2 < r%. 

For the WNR process we first derive from and 
© the correlation matrix S for A^i, Aj^ with elements 
Sij = (A M A Zj ), i,j = 1,2: 



(12) 



Because the difference of two Gaussian distributed 
random variables is also Gaussian, we can derive two- 
dimensional PDF of Aj^jA;^ using the general two- 
dimensional form of the Gaussian distribution (after [l|) 

(13) 

Using (|12fl and (|13fl . we can now explicitly calculate both 
sides of Eq. I|ll|) , namely 



^(A^IA,,!) = 



exp JA,3-A„/2)- (u) 
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FIG. 2: Conditional PDF of left-justified (solid lines) and centered (broken lines) increments of experimental data, (a) Height 
profiles from a smooth asphalt road using scales ro = 137, r\ — 104 mm , (b,c) velocity time series from a turbulent free jet, 
with small (b) and large (c) scale differences Sr — ro — ri (see text). PDF are plotted as in Fig. 



and 

/DO 
p(A, i3 |Ai l2 )p(A, i a|Ai ) i)dA,,2 = 
-OO 

1 (A t|3 - A M /4) 2 

V^W^ (15/4)^ ' [ ' 

Obviously the Chapman-Kolmogorov equation (|ll|l is vi- 
olated for left-justified increments of the WNR on any 
scales r\ < T2 < r^. The same procedure can be used 
to see that for centered increments the Markov property 
holds. 



IV. INDICATORS FOR SPURIOUS 
CORRELATIONS CAUSED BY INCREMENT 
DEFINITION 

The question if, or if not, the above-mentioned spuri- 
ous correlations between different scales are introduced 
by the increment definition is of practical importance for 
the analysis of measured data. For data which behave 
like the LRW, i.e., (£(t + r) | £(t) = xq) — xq rather than 
+ r ) I f (*) = ^o) =0 as for the WNR, the increment 
definition should be unimportant. No spurious correla- 
tions would be created in either case. In contrary, for 
data which behave more like the WNR, the increment 
definition should be more important. 

As shown above, the conditional PDF can serve as a 
means to discriminate between true and spurious corre- 
lations between scales if we compare conditional PDF of 
left-justified and centered increments. In Fig. [3 condi- 
tional PDF are shown for these increments of two ex- 
perimental data sets. Figure [SJa) displays PDF of both 
increment types obtained from surface height profiles of a 
smooth asphalt road. The distance between consecutive 
data points is 1.04 mm; further details of the measure- 
ment are found in [jjj ■ The difference between both 
types of increments is evident and similar to that for the 
WNR in Fig.U In Figs.^b) and 2(c) conditional PDF 
are shown in the same manner for velocity increments 



measured in a turbulent free jet at Re = 2.7 x 10 4 (for 
details see Q). In both cases scale r% is L/2. The scale 
difference Sr = ro — r\ is small (1.5 A) for (b) and large 
(L) for (c) 12]. It can be seen in Fig.^b) that here con- 
ditional PDF of left-justified and centered increments are 
identical and the increment definition does not influence 
the statistics. For Sr = L at the end of the inertial range, 
Fig. Efc) , a slight difference of both conditional PDF has 
already occurred. The definition of L [l2j provides that 
for large scale differences Sr > L a transition to a noise- 
like behavior as in figs.|2Ja) and^can be expected. Nev- 
ertheless, only a small fraction of the correlation between 
the scales r$,ri is detected as spurious here. 

As a second indicator the conditional expectation value 

T(r,a;o) = <A,(t,r)|£(t) = a!o) (16) 

can be estimated from the measured data. T(r, xo) quan- 
tifies the influence of the value £(t) = xq on the left- 
justified increment A/(t,r). It follows immediately that 
T(r,x ) = (t(t+r)\£(t) = x )-xo. Witheqs. © and © 
we obtain the ideal cases T(r,xo) — —xo for the WNR 
and T(r,xo) = for the LRW. If for experimental data 
there is a strong dependence of T on xq the data must 
be regarded as noiselike in the sense of the WNR in the 
respective length scale, and for scale-dependent analysis 
the use of left-justified increments is not appropriate. If 
otherwise T is independent of xq the data behave like a 
LRW, and thus the increment definition is not important. 

In Fig. we present the dependence of T on £(i) = xo 
for different data sets. Data of both ideal cases were 
generated as in Fig. ^ Turbulence and asphalt road data 
have already been shown in Fig. [3 As expected, we see 
that for the LRW there is no dependence of T(r, xq) on 
xq. In contrast, for the WNR as well as for the surface 
data the dependence is clear with T(r, xq) = —xq. For 
the turbulent velocity increments it can be seen that on 
the small scale Ar = A the influence of xq on T is only 
small, while on the large scale Ar = L the dependence 
is more pronounced. This finding corresponds to Fig. [2 
where for small scales [Fig. EJb)] the conditional PDF 
of left-justified and centered increments were identical, 
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(A t (t,r) | £(t) — xo) for different data sets, 
function of xo for fixed scales r. 
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T is shown as 



while for large scales [Fig. [He)] a difference occurred. 



V. CONCLUSIONS 

We found that for scale-dependent analysis of stochas- 
tic data, where the connections between different scales 
are investigated using increment statistics, the definition 
of the increment can be important, depending on the na- 
ture of the data. Apparent correlations between scales 
may be introduced by the left-justified increment. The 
importance of the increment definition varies between the 
ideal cases of the LRW (0, where it is nonrelevant, and 
the WNR ©, where it is crucial, fn this case the use of 
left-justified increments leads to biased results for corre- 



lations between different scales. Especially, the surface 
measurement data we have studied require the centered 
definition on all accessible scales 0,^3 • For turbulent ve- 
locities this influence depends on the regarded length (or 
time) scale r. In previous publications pi IbH] no signifi- 
cant difference between the drift and diffusion coefficients 
of the Fokker-Planck equation of A; and A c was found. 
This is in accordance with our findings in Fig.[2Ib), where 
the PDF of A; and A c are shown to be identical for small 
scale differences ro — r\ , and only at the integral length 
scale a difference occurs [see Fig. Et c )] ■ Detailed conse- 
quences are currently being investigated [l5|. 

The conditional expectation value T(r,Xo) allows to 
quantify the influence of a left-justified increment. Nev- 
ertheless, the specification of a threshold in a statistically 
meaningful way is still an open question. 

While in this paper we used the increments Q an d 
(0) to demonstrate the introduction of spurious correla- 
tions, we expect that these considerations can be applied 
to general scale-dependent measures of complexity, such 
as the rms width w r (t) = ((£(£) — £) 2 )r^ 2 or wavelet func- 
tions. One could generally distinguish between measures 
which are orthogonal on different scales and those which 
are not fl6| . We expect similar results for correlations 
between scales as demonstrated here for left-justified and 
centered increments. 
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